Resampling Imbalanced Network Intrusion Datasets to Identify Rare Attacks
نویسندگان
چکیده
This study, focusing on identifying rare attacks in imbalanced network intrusion datasets, explored the effect of using different ratios oversampled to undersampled data for binary classification. Two designs were compared: random undersampling before splitting training and testing after data. study also examines how oversampling/undersampling affect forest classification rates datasets with minority dataor attacks. The results suggest that gives better rates; however, oversampling BSMOTE allows use lower
منابع مشابه
Training Network Managers to Recognise Intrusion Attacks
One of the major challenges facing the e-Business community, and the broader telecommunications network world, is the threat of electronic attack. Of the sub-categories of such attacks, the denial of service attack, in which the intruder’s objective is to prevent legitimate users from accessing some or all of an organisation’s computing resource, regularly creates headlines in the popular press...
متن کاملResampling Methods Improve the Predictive Power of Modeling in Class-Imbalanced Datasets
In the medical field, many outcome variables are dichotomized, and the two possible values of a dichotomized variable are referred to as classes. A dichotomized dataset is class-imbalanced if it consists mostly of one class, and performance of common classification models on this type of dataset tends to be suboptimal. To tackle such a problem, resampling methods, including oversampling and und...
متن کاملClassification in Imbalanced Datasets
In this thesis we study the classification task in the presence of class imbalanced data. This task arises in many applications when we are interested in the under-represented (minority) classes. Examples of such applications are related to fraud detection, medical diagnosis and monitoring, text categorization, risk management, information retrieval and filtering. Although there exist many stan...
متن کاملApplying Support Vector Machines to Imbalanced Datasets
Support Vector Machines (SVM) have been extensively studied and have shown remarkable success in many applications. However the success of SVM is very limited when it is applied to the problem of learning from imbalanced datasets in which negative instances heavily outnumber the positive instances (e.g. in gene profiling and detecting credit card fraud). This paper discusses the factors behind ...
متن کاملImbalanced Datasets: from Sampling to Classifiers
Classification is one of the most fundamental tasks in the machine learning and data-mining communities. One of the most common challenges faced when trying to perform classification is the class imbalance problem. A dataset is considered imbalanced if the class of interest (positive or minority class) is relatively rare as compared to the other classes (negative or majority classes). As a resu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Future Internet
سال: 2023
ISSN: ['1999-5903']
DOI: https://doi.org/10.3390/fi15040130